Supporting Priorities and Improving Utilization of the IBM SP Scheduler Using Slack-Based Backfilling
نویسندگان
چکیده
Distributed memory parallel systems such as the IBM SP2 execute jobs using variable partitioning. Scheduling jobs in FCFS order leads to severe fragmentation and utilization loss, which lead to the development of backfilling scheudlers such as EASY. This paper presents a backfilling scheduler that improves EAST in two ways: It supports both user selected and administrative priorities, and guarantees a bounded wait time for all jobs. The scheduler gives each waiting job a slack, which determines how long it may have to wait before running: ‘important’ and ‘heavy’ jobs will have little slack in comparison with others. Experimental results show that the priority scheduler reduces the average wait time by about 15% relative to EASY in an equal priorities scenario, and is responsive to differential priorities as
منابع مشابه
Supporting Priorities and Improving Utilization of the IBM SP2 Scheduler Using Slack Based Backfilling
Running jobs on the IBM SP2, as in most distributed memory parallel system in the market today, is done by giving each job a subset of the available processors for its exclusive use. Scheduling jobs in FCFS order suffers from severe fragmentation that leads to utilization loss. This led Argonne National Lab, where the first large SP1 was installed, to develop the EASY scheduler, which has since...
متن کاملSupporting Priorities and Improving Utilization of the IBM SP2 Scheduler Using Slack-Based Back lling
Running jobs on the IBM SP2, as in most distributed memory parallel system in the market today, is done by giving each job a subset of the available processors for its exclusive use. Scheduling jobs in FCFS order suuers from severe fragmentation that leads to utilization loss. This led Argonne National Lab, where the rst large SP1 was installed, to develop the EASY scheduler, which has since th...
متن کاملUtilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling
ÐScheduling jobs on the IBM SP2 system and many other distributed-memory MPPs is usually done by giving each job a partition of the machine for its exclusive use. Allocating such partitions in the order in which the jobs arrive (FCFS scheduling) is fair and predictable, but suffers from severe fragmentation, leading to low utilization. This situation led to the development of the EASY scheduler...
متن کاملBackfilling with lookahead to optimize the packing of parallel jobs
The utilization of parallel computers depends on how jobs are packed together: if the jobs are not packed tightly, resources are lost due to fragmentation. The problem is that the goal of high utilization may conflict with goals of fairness or even progress for all jobs. The common solution is to use backfilling, which combines a reservation for the first job in the interest of progress with pa...
متن کاملScheduling Algorithms to Improve Utilization in Toroidal-Interconnected Systems
BlueGene/L is a massively parallel cellular architecture system with a toroidal interconnect, currently being developed at the IBM T.J. Watson Research Center. Cellular architectures with a toroidal interconnect are effective at producing highly scalable computing systems, but typically require job partitions to be both rectangular and contiguous. These restrictions introduce fragmentation issu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999